System and method of monitoring and controlling application files

ABSTRACT

A system and method for updating, monitoring, and controlling applications on a workstation. The workstation includes a workstation management module configured to detect the launch or request to access a network by an application. A workstation application server receives data associated with the application from the workstation. The application server module can determine one or more policies or categories to associate with the application by referencing an application inventory database. Once the application server module has the category or policy, it forwards a hash/policy table to the workstation management module. Upon receipt of the hash/policy table, the workstation management module applies the policy that is associated with the application to control network access by the application.

RELATED CASES

This application is a continuation of application Ser. No. 12/403,313, filed on Mar. 12, 2009, issued as U.S. Pat. No. 8,150,817 B2, entitled SYSTEM AND METHOD OF MONITORING AND CONTROLLING APPLICATION FILES, which is a divisional of application Ser. No. 11/134,815, filed on May 19, 2005, now U.S. Pat. No. 7,529,754 B2, entitled SYSTEM AND METHOD OF MONITORING AND CONTROLLING APPLICATION FILES, which is a continuation-in-part of application Ser. No. 10/390,547, filed Mar. 14, 2003, issued as U.S. Pat. No. 7,185,015 B2, all of which are hereby expressly incorporated by reference in their entireties.

BACKGROUND

1. Field of the Invention

The invention is related to computing devices and, more particularly to monitoring and controlling application files operating thereon.

2. Description of the Related Art

The Internet is a global system of computers that are linked together so that the various computers can communicate seamlessly with one another. Employees can access server computers to download and execute rogue programs and also operate peer-to-peer file sharing in the workplace, both of which pose new threats to an employer. For example, instant messaging (IM) can pose a security risk to an employer's company since many IM systems allow file transfer among computers. Because the employees can activate IM themselves, the employer does not know who sees sensitive data transmitted between the computers. However, IM can be a productive tool, when used in accordance with company policy. In addition, streaming media is a growing concern because of its drain on network bandwidth. Finally, employees that have illegal or unlicensed software on their workstations can present undesirable liability risks to the company because the company can be held responsible for the employee's use of the illegal or unlicensed software.

Software is available to manage how employees access the Internet in the workplace, preserving employee productivity, conserving network bandwidth and storage costs, limiting legal liabilities and improving network security. However, with the growth of the new threats described above, employers need new solutions to manage the broader intersection of employees with their computing environments.

SUMMARY

The systems and methods of the invention have several features, no single one of which is solely responsible for its desirable attributes. Without limiting the scope of the invention as expressed by the claims which follow, its more prominent features will now be discussed briefly. After considering this discussion, and particularly after reading the section entitled “Detailed Description of the Invention” one will understand how the features of the system and methods provide several advantages over traditional filter systems.

One aspect is a system for collecting network access data for use in updating a monitoring system which controls programs accessing a network. The system comprises a workstation configured such that a program resident thereon can access a network, a workstation management module coupled to the workstation and configured to detect the program accessing the network, determine whether the program is in a network access database, send program data associated with the program to an application server module if the program is not in the network access database, and apply one or more policies that are associated with the program, wherein the one or more policies are received from the application server module, and an application server module coupled to the workstation and configured to receive the program data from the workstation management module if the program was not in the network access database, determine whether the program is operating in a predetermined manner, if the program is not operating in a predetermined manner, then send the program data to an application database factory, if the program is operating in a predetermined manner, then provide the one or more policies associated with the program to the workstation management module.

Another aspect is a method of updating a system which controls operation of programs on a workstation. The method comprises detecting a network access attempt by an application, generating an application digest for the application, determining whether the application is associated with one or more policies, if the application is associated with one or more policies, then applying the one or more policies that are associated with the application, and if the application is not associated with one or more policies, then posting the application to a logging database. The method further comprises uploading the logging database to an application server module, determining whether the application is in an application inventory database, wherein the application is associated with one or more policies, and if the application is not in the application inventory database of the application server module, then posting the application to a network access database, if the application is in the application inventory database, then applying one or more policies associated with the application.

Yet another aspect is a method of collecting collection data for use in updating a system which controls network access of programs. The method comprises detecting access request to a network by a program, determining whether the program is stored in a table, if the program is stored, applying a first rule that is associated with the program, and if the program is not stored, posting the program to a database.

Still, another aspect is a method of updating a system which controls network access by programs on a workstation. The method comprises detecting a network access request of an application, generating a hash value for the application, wherein the hash values includes network access data, comparing the generated hash value to one or more hash values in a hash/policy table that includes one or more policies associated with the one or more hash values, if the generated hash value matches one or more of the hash values in the hash/policy table, then applying the one or more policies that are associated with the one or more hash values, and if the generated hash value does not match one or more hash values in the hash/policy table, then posting the application to a logging database. The method further comprises uploading the logging database to an application server module, determining whether the application from the logging database is in an application inventory database, and if the application is not in the application inventory database, then posting the application to a network access database.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of a site collection system for controlling application files on a workstation.

FIG. 2 is a block diagram of a work station management module.

FIG. 3 is a block diagram of an application server module.

FIG. 4A is an illustration of a database of parent groups and categories that can be associated with an application file.

FIG. 4B is an illustration of network access data that can be associated with an application file.

FIG. 5 is a block diagram of an application database factory.

FIG. 6 is an illustration of a screen shot of one embodiment of a graphical user interface (GUI) for an application analyst's classification module.

FIG. 7 is a flow diagram illustrating a process for monitoring and controlling the launch of an application on the workstation.

FIG. 8 is a flow diagram illustrating a process performed by the workstation for uploading and downloading collection data with the application server module.

FIG. 9 is a flow diagram illustrating a process performed by the application server module for uploading and downloading collection data with the workstation.

FIG. 10 is a flow diagram illustrating a process for classifying an uncategorized application at the application server module.

FIG. 11 is a flow diagram illustrating a process for uploading application data from the application server module to the application database factory.

FIG. 12 is a flow diagram illustrating a process for downloading application data from the application database factory to the application server module.

FIG. 13 is a flow diagram illustrating a process for classifying an uncategorized application at the application database factory.

FIG. 14 is a flow diagram illustrating a process for monitoring and controlling the behavior of a launched application on the workstation.

FIG. 15 is a flow diagram illustrating a process performed by the workstation for uploading and downloading collection data for network accessing applications with the application server module.

FIG. 16 is a flow diagram illustrating a process performed by the application server module for uploading and downloading collection data for network accessing applications with the workstation.

FIG. 17 is a flow diagram illustrating a process for analyzing network access data for a launched application at the application server module.

FIG. 18 is a flow diagram illustrating a process for uploading network access data from the application server module to the application database factory.

FIG. 19 is a flow diagram illustrating a process for downloading network access data from the application database factory to the application server module.

FIG. 20 is a flow diagram illustrating a process for analyzing the network access data associated with an application at the application database factory.

DETAILED DESCRIPTION OF THE INVENTION

The following detailed description is directed to certain specific embodiments of the invention. However, the invention can be embodied in a multitude of different systems and methods. In this description, reference is made to the drawings wherein like parts are designated with like numerals throughout.

In connection with the following description, many of the components of the various systems which may be included in the entire system, some of which are referred to as modules, can be implemented as software, firmware or a hardware component, such as a field programmable gate array (FPGA) or application specific integrated circuit (ASIC), which performs certain tasks. Such components or modules may be advantageously configured to reside on the addressable storage medium and configured to execute on one or more processors. Thus, a module may include, by way of example, components such as software components, object oriented software components, class components and task components, processes, functions, attributes, procedures, subroutines, segments of program code, drivers, firmware, microcode, circuitry, data, databases, data structures, tables, arrays and variables. The functionality provided for in the components and modules may be combined into fewer components and modules or further separated into additional components and modules. Additionally, the components and modules may advantageously be implemented to execute on one or more computers.

FIG. 1 is a block diagram of a local area network (LAN) 100 coupled to an Internet 108 and an application database factory 110, which is also coupled to the Internet 108. For ease of explanation, only a single LAN is shown, though two or numerous such networks would more typically be included. Similarly, two or more application database factories could also be deployed.

The LAN 100 includes one or more workstations 101 coupled to an application server module 102. The application server module 102 communicates via the Internet 108 in order to upload and download applications and application related data with the application database factory 110. The LAN 100 can have an Ethernet 10-base T topology, or be based on any networking protocol, including wireless networks, token ring network and the like.

The workstation 101 is coupled to the application server module 102. The workstation 101 can be a personal computer operating, for example, under the Microsoft Windows operating system, however, other computers, such as those manufactured by Apple or other systems, can be used.

The application server module 102 couples the LAN 100 with the Internet 108. The application server module 102 communicates with the Internet 108 via connection devices, such as routers or other data packet switching technology, for translating Internet TCP/IP protocols into the proper protocols for communicating with the Internet 108. The connection devices used to implement a given system can vary as well as its location within the LAN 100. For example, the connection devices could be located at the workstation(s) 101 or connected peripherally to the Internet 108. An exemplary connection device includes a firewall module (not shown) coupled to a router module (not shown).

FIG. 2 is a block diagram of the workstation management module 200 from the workstation 101 in FIG. 1. The workstation management module 200 can include an application digest generator 201, a client inventory module 202, an upload/download module 203, a hash/policy table 204, a logging database 206, a network access detection module 208, and an execution launch detection module 210.

The workstation management module 200 can detect the launch of an application on the workstation 101 and determines an access privileges for the workstation 101 and/or user. For example, an access privilege can include allowing the launched application to run on the workstation 101. Access privileges can be in the form of one or more policies or rules. To determine an access privilege for the workstation 101 and/or user, the workstation management module 200 can utilize a predetermined association between the launched application and one or more categories. The one or more categories can be further associated with the access privileges/policies or rules for the workstation 101 and/or user. Alternatively, the launched application is directly associated with an access privilege.

In addition to or in the alternative of detecting the launch of an application and determining whether to allow the application to run on the workstation 101, the workstation management module 200 can monitor the ongoing network activity or behavior of the application. After the application is allowed to run on the workstation 101, the application may or may not access the network. The workstation management module 200 monitors the ongoing behavior of the application even after the workstation management module 200 determines an access privilege for the application to run on the workstation. For example, each time an application that the workstation management module 200 has allowed to run on the workstation 101 attempts to accesses a network, the workstation management module 200 determines whether to allow the application to access the network. In this way, the workstation management module 200 can keep or change a previous access privilege based on the subsequent activity of the application. The access privilege relating to the determination of the workstation management module 200, as to whether to allow an application to launch on the workstation 101, can be stored with or separately from the access privilege that relates to the determination of the workstation management module 200 to allow the application to access the network.

In response to the application attempting to access the network, the workstation management module 101 can select a unique access privilege for the workstation 101 and/or user. The access privilege may be unique to every workstation or to multiple workstations. For example, the access privilege can include allowing the application to access the network or disallowing access to the network from one or more workstations. Access privileges can be in the form of one or more policies or rules.

To determine the access privilege for the workstation 101 and/or user, the workstation management module 101 can utilize a predetermined association between the application and an expected network behavior or activity for the application. This predetermined association is based upon prior or contemporaneous network activity for the application. For example, the expected network activity for an application running on a first workstation 101 can be determined from a record of that application's prior activity on the first workstation. In addition or in the alternative, the expected network activity for an application is determined from a record of that application's prior activity on multiple workstations.

The expected network activity can be determined from network activity by a different but related application. For example, the programs or applications from a single software company may have common access privileges. The access privilege associated with a later version of an application may share common access privileges with an earlier version of the same application.

The network activity of the same application running on different workstations can be weighted in a predetermined manner to determine an expected network activity for the application. The expected network activity can determine a common access privilege for multiple workstations. The workstation management module stores the expected network activity in the hash/policy table 204. In a preferred embodiment, the network activity from multiple workstations is uploaded to the application database factory 110. The access privilege can be determined at the application database factory 110.

The expected network activity for a given application can include one or more network attributes that are associated with the application. The attributes are associated with the application when the application accesses the network in an expected manner. These attributes can include, for example, a specific protocol, a specific I.P. address, and a specific access port. For example, the specific protocol for an application is listed in the hash/policy table 204. If the application requests access to the network using a different protocol than the expected protocol listed in the hash/policy table 204, the network access detection module 208 may disallow access.

An application may request access to the network multiple times in a single day. However, one or more of the network attributes associated with the application may be different for each attempted access. In this way, the attributes of the application may change over time. The network detection access module 208 may allow a first combination of one or more network attributes while disallowing a second combination of the one or more network attributes.

Each combination of the one or more attributes can be associated with one or more categories. The one or more categories can be further associated with the policies or rules for the workstation 101 and/or user.

When a program or application on a computer or workstation is launched, the execution launch detection module 210 detects the launch. In response to this detection, the workstation management module 200 determines whether to allow or disallow the application to run on the workstation 101.

An application that the workstation management module 200 allows to run on the workstation 101 may or may not request access to a network. For an application that does request access to a network, the application may request access at launch or after the program is running on the workstation 101. For example, once a publisher's application is launched, the application may request access over a network to that publisher's website for updates. Continuing with this example, the hash associated with the application and the combination of one or more network attributes associated with the network access request are compared to the hash/policy table 204 to select a policy or rule. The hash/policy table 204 can further include categories. A category can then be associated with the policy or rules.

When an application or program accesses a network, the network access detection module 108 monitors the behavior or activity of the application or program. The launch detection module 210 and the network access detection module 108 direct the application digest generator 201 to analyze data related to a requested application or data related to a network accessing application. As part of its analysis, the execution launch detection module 210 can generate a hash for the application using the application digest generator 201. The application digest generator 201 parses properties from the requested application to uniquely identify the application. These properties can include network access data. Exemplary properties include the name, publisher, suite, hash, file size, version, protocol, I.P. address, port, and additional information or properties which are associated with a launched or network accessing application.

The hash for the application is determined by transforming the binary associated with the application into a unique set of bits. A hash function, which is a form of encryption known in the art, is employed in determining the hash for the application. In this way, the hash function takes selected binary input from the application and transforms the binary into a fixed-length encrypted output called a hash. The result is a hash with a fixed-size set of bits that serves as a unique “digital fingerprint” for the application. Two exemplary hash algorithms include MD-5 and Secure Hash Algorithm-1 (SHA-1). The MD-5 hash algorithm produces a 128-bit output hash. The SHA-1 algorithm produces a 160-bit output hash.

The parsed properties or attributes are provided to the execution launch detection module 210 and/or the network access detection module 208. For launched applications, the execution launch detection module 210 analyzes the application request from the workstation 101 and then compares the application request with the hash/policy table 204. For applications requesting to access a network, the network access detection module 208 analyzes the network access request from the workstation 101 and then compares the network access request with the hash/policy table 204. The hash/policy table 204 includes one or more predetermined network attributes and one or more policies associated therewith. As will be explained with reference to FIG. 3, the application server module 102 provides the hash/policy table 204 to the workstation management module 200.

The hash/policy table 204 is received from the application server module 102. The hash/policy table 204 can include a list of application names, publishers, suites, hashes, ports, protocols, I.P. addresses, categories, and rules or policies associated therewith. In one embodiment, the one or more parsed properties in the hash/policy table 204 include a list of hash values. Continuing with this embodiment, the hash/policy table 204 further includes a list of policies that are associated with the hash values in the list. In addition to hash values and policies in this embodiment, the hash/policy table 204 could further include a list of categories that are associated with the hash values and/or policies. Moreover, in another embodiment, the hash/policy table 204 does not include hash values. Instead, the hash/policy table 204 includes the names/publishers/suites or other properties which identify the applications in the hash/policy table 204. In still another embodiment, the hash/policy table 204 includes the port/I.P. address/protocol or other properties which identify the applications in the hash/policy table 204.

Once the application is requested to run on the workstation or when the application requests to access the network, the policy from the hash/policy table 204 which corresponds to that application is also identified. The execution launch detection module 210 or the network access detection module 208 compares the properties of the application to the properties in the hash/policy table 204 to determine what access privileges or policies should be applied to the request to run the application or to access the network. These policies or rules can include, for example, allowing execution of the program, allowing access to the network, denying execution of the program, denying access to the network, alerting the user that the request to run the application will be logged, alerting the user that the request to access the network will be logged, allowing the user a specific amount of time in which to run the application, and allowing the user a specific amount of time in which to access the network.

In addition to the policies and rules listed above, the workstation management module 200 can employ other actions, cumulatively referred to as selectable filters, in response to a request to run the application or to a request for an application to access a network. Examples of selectable filters include postponing the running of the application, postponing access to the network, allowing the user to override denial to run the application, allowing the user to override denial to access the network, limiting the user's access to the application based on a quota, limiting the user's access to the network based on a quota, limiting the user's access to the application based on a network load, and limiting the user's access to the network based on a network load. Each requested application or network accessing application can be associated with one or more policies or rules.

In one embodiment, the execution launch module 210 or the network access detection module 208 checks to see if the generated hash matches any hashes stored in the hash/policy table 204. If a match between the requested application and a hash in the hash/policy table 204 is found, the execution launch detection module 210 or the network access detection module 208 applies the policy(s)/rule(s) associated with the hash that matches the application and/or the user requesting the application or network access. For example, if application of the rule by the execution launch detection module 210 indicates that the requested application is not allowed to run on the workstation 101 or to be run by the user, a predefined block page can be sent to the user interface explaining that the requested application is not allowed to run and why. Alternatively, the execution launch detection module 210 simply stops the requested application from running on the workstation 101.

For example, if application of the rule by the network access detection module 208 indicates that the network access requested by the application is not allowed, a predefined block page can be sent to the user interface explaining that the requested application is not allowed to access the network and why. Alternatively, the network access detection module 208 simply stops the requested application from accessing the network.

If the execution launch detection module 210 or the network access detection module 208 does not find the application hash in the hash/policy table 204 (for example, the application is uncategorized or the application is behaving unexpectedly), the module 208, 210 then determines how to proceed with the application. For example, running of the application could be allowed when the execution launch detection module 210 or the network access detection module 208 determines that the application requested is uncategorized or behaving unexpectedly. Alternatively, the execution launch detection module 210 or the network access detection module 208 can stop execution or network access for the application depending on a policy associated with the user at this workstation.

The one or more policies identified for the requested application is applied in response to the request to run the application. In this way, the execution launch detection module 210 or the network access detection module 208 filters each request to run an application using the parsed properties, the hash/policy table 204, and the policies/rules from the hash/policy table. A policy can be provided and utilized even if the application is not found in the hash/policy table 204.

If the requested application is found in the hash/policy table 204, the event is logged in the logging database 206. Information that is logged in the logging database 206 can include, for example, the application name, time of day, port, I.P. address, protocol, and the hash associated with the application. The logging database 206 can also include additional data associated with the application. For example, a request frequency or a time of execution for the application requested can be included in the logging database 206.

If the hash of the uncategorized application is not represented in the logging database 206, the execution launch detection module 210 can store the application name, hash, and information parsed by the application digest generator 201 in the logging database 206. In this way, the logging database 206 can include additional information associated with the requested application. For example, the publisher, suite, file size, hash, protocol, I.P. address, port, directory location, and the like can be included in the logging database 206.

Still referring to FIG. 2, in one embodiment, the client inventory module 202 is configured to inventory the applications on the workstation 101. To that end, the client inventory module 202 can access the hash/policy table 204 to determine whether the applications on the workstation 101 are classified and/or uncategorized. The client inventory module 202 can be configured to perform the inventory of the workstation 101 on a periodic basis. For example, the client inventory module 202 can inventory the applications on the workstation 101 once a day or on any other interval selected. Advantageously, the client inventory module 202 can perform the inventory during non-working hours. The inventory can be determined when the workstation 101 is powered up by the user or powered down by the user. Depending on the configuration of the LAN 100, a network administrator can instruct the client inventory module 202 to perform the inventory. In addition, the inventory can be performed in response to polling by the application server module 102 (see FIG. 1).

Still referring to FIG. 2, the upload/download module 203 can transmit data to and receive data from the application server module 102 (see FIG. 1). For example, the upload/download module 203 can transmit data from the logging database 206 to the application server module 102. In an embodiment where the client inventory module 202 performs an inventory of the applications on the workstation 101, the results of the inventory can be uploaded to the application server module 102 by the upload/download module 203.

The upload performed by the upload/download module 203 can be immediate or periodic depending on the desires of the network administrator. For example, a daily upload after normal business hours could be used. The upload/download module 203 can compute the request frequency from scanning the logging database 206, to prioritize the applications in the logging database 206 for their transmission to the application server module 102. In another embodiment, a frequency count database (not shown) is updated for each entry in the logging database 206. The frequency count database maintains the request frequency for each entry in the logging database 206. In this embodiment, the upload/download module 203 accesses the frequency count database to prioritize the applications.

If data from the logging database 206 is to be uploaded to the application server module 102, the upload/download module 203 can refer to a request frequency for applications found from scanning the logging database 206. The request frequency can be used to prioritize the applications in the logging database 206 for their transmission to the application server module 102.

FIG. 3 is a block diagram of an application server module 102 which communicates with the workstation management module 200 (FIG. 2) to upload and download a list of applications comprising properties of applications as well as policies associated with the applications once categorized. These properties can include network access data associated with each application. For example, parsed properties from requested applications or network accessing applications can be uploaded to the application server module 102 while a list of hash values and policies associated therewith are downloaded to the workstation management module 200. In addition, the category associated with the application can be transmitted to the workstation management module 200. If the category associated with the application is available to the workstation management module 200, the workstation management module can select the access privilege for the workstation and/or user that corresponds to the one or more categories associated with the application. When more than one category is associated with the application and the categories have different policies associated thereto, one or both rules/policies can be used for the access privilege.

The application server module 102 can include an application inventory database 103, a workstation upload/download module 104, a factory upload/download module 105, a classification user interface 106, and a policy database 109. The application inventory database 103 can further include an uncategorized application database 108 and a network access database 107. Alternatively, the uncategorized application database 108 and/or the network access database 107 are combined into a single database or can be separate databases from the application inventory database 103.

The network access database 107 includes network access data associated with the application. The network access data includes parsed properties or attributes that are associated with the application when the application accesses the network. The network access data is uploaded to the application server module 102. The uploaded network access data can be compared to expected network access data for the application. As explained above, the expected network access data can be a compilation of network access data associated with contemporaneous or prior network access by the application. The expected network access data can be compiled from the requesting workstation or from other workstations.

The expected network access data is derived from network access data obtained when the application is behaving in a predetermined manner. The uploaded network access data allows the application server module 102 to monitor the behavior of categorized and uncategorized applications. The categorized application is monitored each time the application accesses the network. A categorized application that does not operate in a predetermined manner is placed in the uncategorized applications database 108. In this way, the same application may have different entries in the application inventory database 103. For example, a first entry in the application inventory database 103 corresponds to the application hash and parsed properties or features associated with the application when the application is operating in a predetermined manner. These expected features relate to normal network activity by the application. A second entry corresponds to the same application hash but different parsed properties or features associated with the application when the application is not operating in a predetermined manner.

The network administrator, or the like, interfaces with the application server module 102 via the classification user interface 106. The network administrator can classify uncategorized applications and/or recategorize previously categorized applications. The uncategorized applications can include applications that are not categorized and/or applications that are categorized but are not operating in a predetermined or expected manner. In the later case, the network access detection module 208 looks to the network attributes when determining whether the application is operating in a predetermined or expected manner.

The network administrator receives the data from the application inventory database 103 via the classification user interface 106. The network administrator can further interface through the classification user interface 106 to select or create access privileges/policies/rules for users, workstation, and/or groups of users/workstations. These rules are stored in the policy database 109. These rules can include, for example, allowing applications associated with selected categories to execute on a given workstation 101. These rules can also include, for example, allowing the application to access the network from a given workstation 101. Rules can also include selectable filters. For example, rather than simply not allowing the application to execute or to access the network, the network administrator may select or create a selectable filter which is applied when the application is requested to run or access the network. The rules are provided to the workstation management module 200 via the workstation upload/download module 104. In this way, the execution launch detection module 210 and/or the network access detection module 208 (see FIG. 2) apply the rule that is associated with the category or network access data of the application.

One function of the workstation upload/download module 104 is to receive identifiers for the application names and any additional data or parsed properties which are associated with the application names from the workstation management module 200. For example, the identifier for an application name could be a hash value or the name of the application itself. In one embodiment, the application names include names from the logging database 206. The additional data can also include a request frequency for an application found in the logging database 206, the request frequency for an application found in the logging database 206, a trace ID, a primary language used by the workstation management module 200, source IP address, destination IP address, source port number, destination port number and other network access data. For ease of explanation, the term “collection data” will be used to include applications and any additional data associated with the application. Additionally, the workstation upload/download module 104 downloads all or portions of the application inventory database 103 to the workstation management module 200 as will be described more fully below.

The workstation upload/download module 104 receives the collection data from the upload/download module 203 (see FIG. 2) and processes the collection data. Processing can include merging and sorting the collection data from multiple workstation management modules. The workstation upload/download module 104 determines whether each application in the collection data requires categorization. For example, an application that is not operating in a predetermined manner may require categorization.

If an application has not been previously categorized or if the application has been previously categorized but is not operating in a predetermined manner, the collection data associated with that application is stored in the uncategorized application database 108. Thus, an application name which is found in the application inventory database 103 can also be stored in the uncategorized applications database 108 if that application is not operating in a predetermined manner. For applications that are not operating in a predetermined manner, the collection data can be further stored in the network access database 107. As explained above, the network access database 107 can be separate from or combined with the uncategorized applications database 108.

The network administrator receives the collection data (for example, application information and any additional data associated with the application) from the uncategorized application database 108 and/or the network access database 107. The network administrator, via the classification user interface 106, is then able to categorize the uncategorized application and/or associate a policy with the category or application. Applications that the network administrator determines are not operating in a predetermined manner are categorized or re-categorized. The application is re-categorized based on parsed properties or features which are different than the parsed properties or features associated with the original categorization of the application. Once categorized or re-categorized, the application is stored in the application inventory database 103. The application inventory database 103 may include one or more entries for the application when the application is operating in a predetermined manner as well as one or more entries for the same application when the application is not operating in a predetermined manner. As will be described below, if the network administrator does not classify the application, the application database factory 110 can classify the collection data.

Once the application has been classified or categorized by the network administrator, the application and the associated category are posted to the application inventory database 103. The workstation upload/download module 104 thereafter routinely copies the application inventory database 103 or a portion thereof to the workstation management module 200 (see FIG. 2). For example, data from the application inventory database 103 can be copied to the hash/policy table 204. The policies in the policy database 109 can be incorporated into the downloaded data from the application inventory database 103 or downloaded separately from the application inventory database 103. As can be imagined, the system can include thousands of workstation management modules 200, each of which is updated regularly by the workstation upload/download module 104 to provide updated data to the hash/policy table 204. In some embodiments, the workstation upload/download module 104 transfers portions of the application inventory database 103. For example, the workstation management module 200 can receive updates so that the entire database need not be transmitted. In other embodiments, the workstation management module 104 receives a subset of the data from the application inventory database 103. For example, the selected data could be the hash values. The policies from the policy database 109 could then be incorporated with the hash values and downloaded to the workstation management module 104. Flowcharts of the process performed by the application server module 102 are shown in, and will be described with reference to, FIGS. 9 and 16.

Still with reference to FIG. 3, the factory upload/download module 105 is configured to transmit data from the application inventory database 103 to the application database factory 110. The upload could be immediate or periodic depending on the level of service required by the network administrator. For example, a daily upload after normal business hours could be used. The factory upload/download module 105 can refer to request frequency or associated network access data to prioritize the applications in the application inventory database 103 for their transmission to the application database factory 110. The factory upload/download module 105 can refer to the uncategorized application database 108 and/or the network access database 107 to select collection data for uploading to the application database factory 110. If data from the uncategorized application database 108 or the network access database 107 is to be uploaded to the application database factory 110, the factory upload/download module 105 can refer to a request frequency to select applications from the uncategorized application database 108 for uploading to the application database factory 110. In this way, the request frequency can be used to prioritize the applications in the uncategorized application database 108 or the network access database 107 for their transmission to the application database factory 110.

The factory upload/download module 105 can further upload applications that have been classified by the network administrator. As described above, the network administration can classify or categorize applications via the classification user interface 106. In this way, the application database factory 110 receives the newly classified applications from the application server module 102. As can be imagined, the application database factory 110 can receive applications and associated categories from thousands of application server modules 102.

The workstation upload/download module 104 can receive an inventory taken by the client inventory module 202 from the upload/download module 203 (see FIG. 2). Once uploaded to the application server module 102, the network administrator can review one or more inventories to determine what applications are being used by each workstation 101. The network administrator can review one or more inventories to determine whether the categorized application is operating in a predetermined manner. The inventory can include categorized as well as uncategorized applications. Depending on the configuration of the LAN 100, the network administrator can review the one or more inventories at the workstation management module 200 (see FIG. 2).

FIG. 4A is an illustration of one embodiment of a database of parent groups and categories that are associated with the applications. In the illustrated embodiment, one or more of the categories listed in the database are further associated with risk classes. Examples of risk classes include security, liability, and productivity. The risk classes can be useful to the network administrator when associating rules/policies with each application. Moreover, in some embodiments each rule/policy is associated with the applications based on the risk class that is associated with each category.

Still referring to FIG. 4A, exemplary categories of applications include operating systems, anti-virus software, contact managers, collaboration, media players, adult, and malicious applets and scripts. The categories can be further grouped into parent groups. For example, parent groups might include system, access/privacy, productivity, communication, audio/video, entertainment, and malware. For each one of the parent groups and/or categories, the network administrator can select an individual policy or rule to associate therewith. Thus, once the requested application is categorized, the application server module 102 can select the policy or rule that is associated with that category.

FIG. 4B is an illustration of network access data that can be associated with an application file. In the illustrated embodiment, each masked hash values corresponds to an application accessing the network. The parsed properties or features associated with the applications can include a source IP address, destination IP address, source port number, destination port number and other network access data. In FIG. 4B, these parsed properties include the transport protocol, destination port, and destination I.P. address. The application corresponding to the hash “aafd61a161ae747844bf128d1b61747a95472570” employed transport Transmission Control Protocol (“TCP”), port “80”, and destination I.P. address 207.46.248.112.” User Datagram Protocol (“UDP”) is another transport protocol that can be used. The network access data for the application allows the network administrator to discriminate between expected/predetermined behavior and unexpected behavior of the application. Different rules/policies can be applied to multiple entries for the same application depending on the network access data that is associated with each entry.

FIG. 5 is a block diagram of the application database factory 110 connected to the Internet 108. The application database factory can be implemented as one or more computers or servers with related data storage. The application database factory 110 provides the application inventory database to the application server module 102 and processes data that is associated with uncategorized applications and other information. Uncategorized applications include applications previously categorized that are not operating in a predetermined manner. The other information may include frequency usage from the application inventory database 103. In one embodiment, the application database factory 110 receives uncategorized applications and any additional data associated with the application from the application server module 102 and downloads categorized applications to the application server module. The application database factory 110 can also upload the request frequency for the applications.

The application database factory 110 can include an upload/download module 301, a master application database 300, and an application analyst's classification module 302. The master application database 300 can further include an uncategorized applications database 303 and/or a network access database 304. Alternatively, the uncategorized applications database 303 and/or the network access database 304 are combined into a single database or can be separate databases from the master application database 300.

One function of the upload/download module 301 is to receive collection data (for example, applications and any additional data associated with the application) from the application server module 102. In one embodiment, the collection data includes applications from the uncategorized application database 108, the network access database 107, and the application inventory database 103. The collection data can include a request frequency for an application found in the application inventory database 103 (see FIG. 3), a request frequency for an application found in the uncategorized application database 108, a trace ID, a destination port number, other network access data, and a primary language used by the application server module 102.

The upload/download module 301 receives the collection data from the factory upload/download module 105. The upload/download module 301 processes the collection data. Processing can include merging, sorting, and determining a language for the collection data from multiple application server modules 102. The upload/download module 301 determines whether each application in the collection data requires categorization. If the application has not been previously categorized or if the application is not operating in a predetermined manner, the application analyst's classification module 302 receives the application and any additional data associated with the application from the upload/download module 301.

The application analyst classification module 302 is coupled to the master application database 300. The application analyst classification module 302 is configured to manipulate and manage data from the master application database 300. The application analyst classification module 302 receives applications and their associated data from the master application database 300. The associated data can include, for example, an IP address, a publisher and suite that correspond to the application.

The application analyst's classification module 302 classifies or categorizes applications which are then added to the master application database 300 of categorized applications. A human reviewer interacts with the application analyst's classification module 302 to perform the categorization or recategorization. The process for classifying or categorizing applications at the application database factory is described with reference to FIGS. 13 and 20.

For a human reviewer, a set of PC-based software tools can enable the human reviewer to manipulate, scrutinize, and otherwise manage the applications from the master application database 300. The human reviewer can interact with the application analyst classification module 302 via a graphical user interface (GUI). In this way, the GUI provides a graphical interface tool for the human reviewer to manipulate and manage the master application database 300. The GUI includes a representation of the application ID and the related textual information. The GUI can include buttons preloaded with algorithmically derived hints to enhance productivity of the human reviewer. These identities can be selected based on, for example, the URL that is identified as the source of the application. An exemplary GUI will be described below with reference to FIG. 6.

The application analyst's classification module 302 is configured to select applications and their associated data from the master application database 300. The application analyst classification module 302 can apply rules to select a subset of applications from the master application database 300. These rules can be dependent upon, for example, categories, languages, suites, dates, and source directories. The application analyst classification module 302 can use SQL queries, in conjunction with the rules, to select the subset for categorization or recategorization from the master application database 300.

The application analyst classification module 302 can analyze each application, the collection data, any text objects associated with the application, any additional data associated with the application, and any additional data retrieved independent of the collection data to determine one or more appropriate categories. Exemplary independent data includes data from an Internet search that utilizes the collection data. Categorization can be based upon word analysis, adaptive learning systems, and image analysis.

In one embodiment, the application analyst classification module 302 accesses the Internet 108 and performs a search based on the application and the collection data. In one embodiment, a GUI button preloaded with the publisher of the application is selected by the human reviewer to initiate an Internet search. The Internet search can provide the application analyst's classification module 302 with additional information for categorizing the application. For example, the search can identify a uniform resource locator (URL) which is the address of a computer or a document on the Internet that is relevant to the categorization process for the application. The URL consists of a communications protocol followed by a colon and two slashes (e.g.: http://), the identifier of a computer, and usually a path through a directory to a file. The identifier of the computer can be in the form of a domain name, for example, www.m-w.com, or an Internet protocol (I.P.) address, for example, 123.456.789.1. There are often addresses, components thereof (for example, I.P. address, domain name, and communication protocol), or other location identifiers that can be used to identify computers or documents on the Internet. For ease of description, the term URL is used hereafter in reference to their addresses. The application analyst's classification module 302 can utilize the hash and/or URL associated with the application to aid in categorizing the application.

Once categorized, the application analyst classification module 302 posts the application along with its associated one or more categories into the master application database 300 of applications. The master application database of applications can include applications and their associated categories. The master application database 300 can be stored in a relational database management system, such as Oracle, Sybase, Informix, Microsoft Server, and Access. A text object posting system can perform this posting. A more detailed block diagram of the process performed via the application analyst's classification module 302 is shown in FIGS. 13 and 20.

Once the application analyst classification module 302 has posted the application and its associated category or categories into the master application database 300, the upload/download module 301 thereafter routinely copies the master application database 300 to the application server module(s) 102. As can be imagined, the system can include thousands of application server modules 102, each of which is updated regularly by the upload/download module 301 to provide an updated database of categorized applications. Moreover, the upload/download module 301 can transfer portions of the master application database 300, such as updates, to the application server module 102 so that the entire database does not need to be transmitted. A flowchart of the process performed by the application database factory 110 is shown in, and will be described with reference to, FIGS. 11 and 18.

In some embodiments, the application analyst classification module 302 can process the categorized applications selected from the master application database 300 for their subsequent download to the application server module 102.

Referring now to FIGS. 5 and 6, a screen shot of one embodiment of a graphical user interface for the application analyst's classification module 302 is shown. In FIG. 6, the highlighted application filename is “cmdide.sys.” The name of the application is “CMD PCI IDE Bus Driver.” In this example, additional information uploaded to the application database factory 110 includes the publisher CMD Technology, Inc. and the related suite, Microsoft Windows Operating System. The application analyst's classification module 302 displays this information to the human reviewer to aid in categorizing the application.

As shown in FIG. 6, the application, CMD PCI IDE bus driver, was associated with the URL “http://wvvw.microsoft.com//ddk/ifskit/links.asp”. In this example, the application analyst's classification module 302 classified the application in the parent group titled access/privacy. The application analyst classification module 302 can perform further categorization of the application. For example, in the parent group titled access/privacy, the application could be classified under anti-virus software, authentication, encryption, firewalls, hacking, remote access, spy ware, or system audit. One or more risk classes can be used to group categories. The risk classes can be useful to the network administrator when associating rules/policies with each application. As mentioned above, one or more categories can be associated with a single application or hash value.

FIG. 7 is a flow diagram illustrating the process of monitoring and controlling the execution of a requested application on the workstation 101. The process begins at a start state 700. Next, at a state 702, the user of the workstation 101 launches an application. The launch of the application can be in response to a predetermined startup sequence for the workstation 101. For example, the workstation 101 could be programmed to launch one or more applications upon power-on startup. The execution launch detection module 210 (see FIG. 2) detects the launch of the application. Next, at a state 704, the application digest generator 201 generates a digest of data relating to the launched application. The digested data can be in the form of collection data. The collection data can include, for example, the publisher, suite, one or more hashes, and source directory.

The process moves to a decision state 706 where the execution launch detection module 210 compares the application digest prepared by the application digest generator 201 to the hash/policy table 204. For example, a hash generated by the application digest generator 201 can be compared to hashes from the hash/policy table 204. In one embodiment, a plurality of different hashes is generated and compared to hashes from the hash/policy table 204. For example, an MD-5 hash and an SHA-1 hash could be generated for the requested application and compared to MD-5 hashes and SHA-1 hashes from the hash/policy table 204.

If the hash corresponds to a hash stored in the hash/policy table 204, the process continues to a state 710 where the policy associated with the hash is applied in response to the launch of the requested application. For example, these policies can include allowing the execution of the application, denying execution of the application, alerting the user that the execution of the application may receive further scrutiny by the network administrator, or allow for a certain amount of time for running the application. In this instance, at the end of the specified time, the execution launch detection module 210 does not permit the application to continue running on the workstation 101. Next, at a state 712, the execution launch detection module 210 logs the event to the logging database 206. In this way, a record is maintained of the applications that are allowed to execute on the workstation 101. The process then moves to a state 714 where the execution launch detection module 210 monitors the system in order to detect the launch of another application on the workstation 101.

The retrieved information from the hash/policy table 204 further includes a policy associated with the hash value. In one embodiment, category information, which corresponds to the hash value, is utilized in selecting the policy. For example, a hash value could be associated with a parent group and/or category. The parent group and/or category could then be associated with the policy.

Returning to the decision state 706, if the application digest does not correspond with an application or hash classified in the hash/policy table 204, flow moves to a state 716 where the execution launch detection module 210 applies a not-classified application policy to the request to execute the application. The not-classified application policy can include, for example, allowing the application to execute, denying execution, or alerting the user that additional scrutiny will be applied to the requesting of the application, while limiting the amount of time that the application is allowed to run on the workstation 101.

Flow moves to a state 718 where the request to execute the application is logged to the logging database 206. The process continues to state 714 as described above where the execution launch detection module 210 awaits the launch of another application on the workstation 101.

FIG. 8 is a flow diagram illustrating a process performed by the workstation 101 for uploading and downloading collection data with the application server module 102. The process begins at a start state 800. Next, at a state 802, the upload/download module 203 receives an incoming signal from the workstation upload/download module 104. The process proceeds to a decision state 804 where the upload/download module 203 receives a request to download the hash/policy table 204 from the application server module 102. The time for receiving the download file can be periodic, random, added set time, or in response to polling. The upload/download module 203 and/or the workstation upload/download module 104 can initiate the download to the workstation management module 200.

If it is determined in state 804 that the upload/download module 203 is receiving a request to download from the application server module 102, the process moves to a state 806 where the upload/download module 203 receives and stores the hash/policy table 204 or a portion thereof.

For example, the application server module 102 can select data from the application inventory database 103 and policies from the policy database 109 for copying to the hash/policy table 204. The application inventory database 103 can include applications that have been categorized by the application database factory 110 as well as applications that have been categorized via the classification user interface 106. In some embodiments, the workstation upload/download module 104 transfers a portion of the hash/policy table 204. For example, the upload/download module 203 can receive an update so that the entire database need not be transmitted. In other embodiments, the upload/download module 203 receives a subset of the data from the application inventory database 103. For example, the selected data could be the hash values which are combined with the policies.

The downloaded data can update the existing hash/policy table 204. The downloaded data can be in the form of collection data from one or more sources. The sources can include the classification user interface 106 and the application database factory 110. As explained above, the collection data can include any additional data associated with the applications, for example, request frequencies associated with the applications from the application inventory database and/or request frequencies associated with the applications from the uncategorized application database 108, and/or indicators. The process moves to a state 810 where the upload/download module 203 awaits a wake-up signal from the application server module 102.

Returning to the decision state 804, if the upload/download module 203 is not requesting a download from the application server module 102, the process moves to a decision state 812 where the application server module 102 can request an inventory of the applications on the workstation 101. If the application server module 102 requests an inventory of the applications on the workstation 101, the process moves to a state 814 where the client inventory module 202 inventories the applications on the workstation 101. Once the client inventory module 202 compiles a list of the applications on the workstation 101, the process moves to a state 815 where the application digest generator 201 generates a digest of data relating to each application. The application digest generator 201 parses properties from the applications. Examples of such properties include the name, publisher, suite, hash, and version, which are associated with the applications.

The process then moves to a state 824 where the application and the digest are stored in the logging database 206. The process then moves to decision state 820 where the client inventory module 202 determines whether all of the inventoried applications have been stored in the logging database 206. If all of the inventoried applications have not been processed, flow returns to state 824 where the next application inventoried by the client inventory module 202 is processed as described above.

Returning to decision state 820, if all of the applications have been processed, the process moves to state 830 where the upload/download module 203 transmits the logging database 206 to the applications server module 102. Next, the process moves to state 810 where the upload/download module 203 awaits a wake-up signal from the application server module 102.

Returning to decision state 812, if an inventory is not requested by the application server module 102, the process moves to a decision state 826 to determine whether the application server module 102 is only requesting collection data from the logging database 206 for uncategorized applications. If the application server module 102 only requests data for uncategorized applications, the process moves to a state 828 wherein the upload/download module 203 extracts and formats data associated with the uncategorized applications from the logging database 206 for uploading to the application server module 102. The process next moves to a state 830 where the data associated with the uncategorized applications is transmitted to the application server module 102. The collection data uploaded to the application server module 102 can be formatted or unformatted. Additionally, the collection data can be encrypted and/or compressed or not. The workstation upload/download module 104 decrypts and uncompresses the collection data if decryption and/or uncompression is required. The workstation upload/download module 104 reassembles the collection data into a list of applications and any additional data associated with the applications. The workstation upload/download module 104 merges and sorts the collection data.

Next, the process moves to the state 810 where the workstation management module 200 awaits the next wake-up signal from the application server module 102.

Returning to the decision state 826, if the application server module 102 is not requesting only the collection data for the uncategorized applications from the logging database 206, the process moves to a state 832 where the upload/download module 203 extracts and formats all of the application data in the logging database 206. This data can include categorized data for applications that are listed in the hash/policy table 204 and uncategorized data for applications that are not listed in the hash/policy table 204. The collection data can be formatted or unformatted. Additionally, the collection data can be encrypted and/or compressed or not. Flow then proceeds to state 830 where the data from the logging database 206 is uploaded to the application server module 102. The flow then proceeds as described above to state 810 where the workstation management module 200 awaits a wake-up signal from the application server module 102.

FIG. 9 is a flow diagram illustrating a process performed by the application server module 102 for uploading and downloading collection data with the workstation 101. The process begins at a start state 900. Next, at a decision state 902, the workstation upload/download module 104 determines whether to generate a download to the workstation management module 200. The time for receiving the download can be periodic, random, at a set time, or in response to polling. The workstation upload/download module 104 and/or the upload/download module 203 can initiate the download to the workstation management module 200. If the workstation upload/download module 104 is to download to the workstation management module 200, the process moves to a state 904 where the workstation upload/download module 104 extracts policy data from the policy database 109. The policy database 109 associates access permissions to the parent groups and/or categories associated with each application based on the workstation receiving the download. For example, if a workstation were not designated to run applications relating to games, the policy database 109 would identify the parent groups and/or categories which are associated with games for that workstation. The network administrator, via the classification user interface 106, can update the policy database 109. The policy database 109 can include different access privileges for each workstation 101. In this way, different workstations 101 can have different policies associated with the applications running thereon.

The process moves to a state 906 where the workstation upload/download module 104 creates a hash/policy table from the application inventory database 103 in conjunction with the designated policies for this workstation. Each parent group and/or category is associated with the policies extracted from the policy database 109 for each of the one or more workstations receiving a download. Each application or hash in the application inventory database 103 can be associated with a parent group and/or category. Continuing with the example above, the workstation upload/download module 104 selects the hash values from the application inventory database 103 for applications that are associated with the parent group/or categories relating to games. Thus, the same application may be allowed to run on a workstation but not allowed to run on a different workstation. Flow continues to a state 908 where the workstation upload/download module 104 transmits the hash/policy table 204 or a portion thereof to the upload/download module 203. The download file can include the application names, hash values, associated categories, and/or associated policies. Flow then proceeds to end state 910.

Returning to decision state 902, if the workstation upload/download module 104 is not generating a download for the workstation 101, the process moves to a decision state 912 where the workstation upload/download module 104 determines whether to request an upload of the workstation inventory. The workstation inventory can include all, or a portion of, the logging database 206.

If the workstation upload/download module 104 requests an upload from the workstation 101, the process moves to a state 914 where a request is sent by the application server module 102 to the upload/download module 203. Next, at a state 916, the workstation upload/download module 104 receives the requested upload from the workstation 101. The uploaded data can be formatted or unformatted. Additionally, the uploaded data can be encrypted and/or compressed or not. The workstation upload/download module 104 decrypts and uncompresses the uploaded data if decryption and/or uncompression is required at next state 918.

Flow continues to state 920 where the workstation upload/download module 104 reassembles the uploaded data into a list of applications and any additional data associated with the applications. The workstation upload/download module 104 merges and sorts the collected data including the frequency count with other workstation inventories. The system can include thousands of workstation management modules, each of which is regularly uploading data from its logging database 206. As explained above, the uploaded data can include any additional data associated with the application, for example, directory location. The workstation upload/download module 104 can merge and sort the uploaded data based on the application or any additional data associated with the application. For example, the workstation upload/download module 104 can refer to a request frequency to sort and merge the applications from one or more workstations 101.

FIG. 10 is a flow diagram illustrating the process of categorizing the applications at the application server module 102. The process begins at a start state 1000. Next, at a state 1002, a network administrator launches the classification user interface 106 via the GUI. The GUI provides a graphical interface tool for the network administrator to manipulate and manage the application inventory database 103. The network administrator extracts a list of applications and/or associated data from the uncategorized application database 108 for review and categorization. The process moves to a state 1004 where the application and any related data is displayed for review by the network administrator. Next, at a state 1006, the network administrator classifies the application based on the displayed data. The process then moves to a state 1008 where the process returns to states 1004 and 1006 for each application extracted from the uncategorized application database 108.

FIG. 11 is a flow diagram illustrating the process of downloading the master application database 300 to the application server module 102 and for uploading inventoried application data from the application server module 102. The process begins at a start state 1100. Next, at a state 1102, the factory upload/download module 105 requests a download of the categorized applications from the application database factory 110. The categorized applications are stored in the master application database 300 at the application database factory 110. The time for receiving the categorized applications can be periodic, random, at a set time, or in response to polling. The factory upload/download module 105 and/or the upload/download module 301 can initiate the download to the application server module 102. As explained above, the downloaded data can include any additional data associated with the application.

Flow continues to decision state 1104 where the factory upload/download module 105 (see FIG. 3) determines whether a send all uncategorized application flag has been activated. The send all uncategorized application flag can be selected by the network administrator via the classification user interface 106. If the send all uncategorized application flag has been activated, the process moves to a state 1106 where the factory upload/download module 105 retrieves all applications from the uncategorized application database 108. Flow continues to decision state 1108 where the factory upload/download module 105 determines if the send all application inventory flag has been activated. The send all application inventory flag can be activated by the network administrator via the classification user interface 106. If the send all application inventory flag has been activated, the process moves to a state 1110 where the factory upload/download module 105 retrieves the data from the application inventory database 103. Flow moves to a state 1112 where the uncategorized applications and any additional data associated with the applications, for example, collection data, can be formatted. The additional data can include request frequencies and/or indicators associated with the applications. The collection data is not required to be formatted and thus may be directly uploaded to the application database factory 110. Moreover, the selection of a format for the collection data can depend on the type of data connection that the application database factory 110 has with the application server module 102. For a data connection via the Internet 108, the factory upload/download module 105 can use a markup language, for example, extensible markup language (XML), standard generalized markup language (SGML), and hypertext markup language (HTML), to format the collection data.

The collection data can be further processed prior to its upload to the application database factory 110. For example, check limit state 1114 and compression and encryption state 1116 can be performed to process the collection data prior to uploading to the application database factory 110. While these blocks may facilitate the upload of the collection data, they are not required to be performed. The collection data can be uploaded without applying states 1114 and 1116. In this way the process can follow alternate path 1113. Thus, the collection data can be directly uploaded to the application database factory 110 without applying states 1114 and 1116.

If further processing is desired, the process moves to a state 1114 where the factory upload/download module 105 can limit the collection data to a maximum size for uploading to the application database factory 110. For example, the collection data from a single workstation could be limited to a maximum of 20 megabytes. The process continues to a state 1116 where the collection data is compressed so that the collection data takes up less space. Further, the collection data is encrypted so that it is unreadable except by authorized users, for example, the application database factory 110.

Flow continues to a state 1118 where the collection data is uploaded to the application database factory 110. As explained above, the collection data can include any additional data associated with the application, for example, suite information. The process moves to a state 1120 where the upload/download module 301 continues with the download to the factory upload/download module 105. The process moves to a state 1122 where the downloaded data is stored in the application inventory database 103.

Returning to decision state 1108, if the send all application inventory flag is not activated, flow moves to state 1112 as described above. Since the send all application inventory flag was not activated, the factory upload/download module 105 formats the data retrieved at state 1106 for its upload to the application database factory 110 as described with reference to states 1112, 1114, 1116 and 1118.

Returning to decision state 1104, if the send all uncategorized application flag was not activated, the process moves to decision state 1108 as described above where the factory upload/download module 105 determines if the send all application inventory flag has been activated. Depending on whether the send all application inventory flag was activated, the process then continues as described above.

FIG. 12 is a flow diagram illustrating a process for collecting data by the application database factory 110. The process begins at a state 1200. Next, at a decision state 1202, the application database factory 110 can download the master application database 300 to the application server module 102. If the application database factory 110 is to download the master application database 300 to the application server module 102, the process moves to a state 1204 where the upload/download module 301 extracts categorized applications from the master application database 300. A subset of the categorized applications can be selected for download to the application server module 102. The subset can include only categorized applications that have been deemed ready for publishing.

The process moves to a state 1206 where the application data retrieved from the master application database 300 can be formatted. The application data is not required to be formatted and this may be directly downloaded to the application server module 102. Moreover, the selection of a format for the data can depend on the type of data connection that the application database factory 110 has with the application server module 102. For a data connection via the Internet 108, the upload/download module 301 can use a markup language, for example, XML, SGML and HTML, to format the collection data.

The data to be downloaded can be further processed prior to its download to the application server module 102. The process continues to a state 1208 where the application data is compressed so that the application data takes up less space. Further, the application data is encrypted so that it is unreadable except by authorized users, for example, the application server module 102. Flow continues to a state 1210 where the application data is downloaded to the application server module 102. The process then moves to state 1212 which is an end state.

Returning to decision state 1202, if application data from the master application database 300 is not being downloaded to the application server module 102, the process moves to a decision state 1214 where the application database factory 110 can receive an upload from the application server module 102. If the application database factory 110 is not to receive an upload from the application server module 102, the process moves to end state 1212.

Returning to decision state 1214, if the application database factory 110 is to receive an upload from the application server module 102, the process moves to a state 1216 where the upload/download module 301 receives the upload from the factory upload/download module 105. The collection data may be received on a periodic basis, randomly, at a set time, or in response to polling. The upload/download module 301 and/or the factory upload/download module 105 can initiate the upload to the application database factory 110. As explained above, the collection can include any additional data associated with the application, for example, request frequencies associated with the application from the application inventory database 103 and/or request frequencies associated with applications from the uncategorized application database 108. The collection data can be formatted or unformatted. Additionally, the collection data can be encrypted and/or compressed or not.

The process continues to a state 1218 where the upload/download module 301 decrypts and uncompresses the collection data if decryption and/or uncompression is required. The process moves to a state 1220 where the collection data is merged and sorted into the master application database 300 and the uncategorized application database 303. The process then continues to end state 1212.

FIG. 13 is a flowchart illustrating the process of classifying applications from the uncategorized application database 303. The process begins at start state 1300. The process moves to a state 1302 where a list of applications is extracted from the uncategorized application database 303 for classification by the human reviewer via the application analyst's classification module 302. The application analyst classification module 302 interfaces with the human reviewer to determine the appropriate category or categories of the application. Next, at a state 1304, the application analyst's classification module 302 is utilized to display the application and any related data on the GUI. The related data can indicate to the human reviewer the category or categories with which the application should be associated. As explained above, the application analyst classification module 302 allows the human reviewer to analyze each application and any additional data that is associated with the application to determine its appropriate category or categories.

The process continues to a state 1306 where the human reviewer uses the application, related information, and any Internet information to research the application. The Internet information can be derived from a search using a web browser search engine. The application name and any of the related application data can be used for the Internet search. The human reviewer can further review documents, specifications, manuals, and the like to best determine the category or categories to associate with the application. The process continues to a state 1308 where the human reviewer classifies each application using the evidence associated with the application, any hints from the related information, and/or other research.

The process finally moves to a state 1310 where the selected category or categories that the human reviewer associated with the given application is stored in the master application database 300.

FIGS. 14-20 describe processes for monitoring the network behavior of an application. While the processes described with reference to FIGS. 7 through 13 were directed to controlling applications when the applications are launched on the workstation, the processes described with reference to FIGS. 14-20 are directed to controlling the operation of the application after the application is initially launched. For example, the execution launch detection module 210 initially evaluates a launched application and allows the application to run on the workstation based on the policy associated with the category or group of the application. Standing alone, the execution launch detection module 210 controls what applications are allowed to operate on any given workstation.

However, the subsequent operation of the application is monitored by the network access detection module 208. Thus, even though an application is allowed to launch on a given workstation, the network access detection module 208 may curtail or limit the application if the application does not operate in a predetermined manner. Further, the network access detection module 208 can continually or periodically monitor the running application to ensure that the applications continues to operate in the predetermined manner.

FIG. 14 is a flow diagram illustrating a process for monitoring the behavior of an application. In addition to monitoring behavior, the process can curtail or control the behavior of the application. The process monitors applications that request access to a network upon launch as well as applications that request access to the network after launch. Thus, applications that the execution launch detection module 210 allows to run on the workstation 101 may or may not be allowed to access the network.

The process begins at a start state 1400. Next, at a state 1402, an application requests access to a network. The request to access the network can be in response to a predetermined startup sequence for the workstation 101. For example, the workstation 101 could be programmed to access one or more networks upon power-on startup. Upon launch or after launch, an application may request access to a publisher's website to download software updates. This request for access may be in response to a user input or the application itself.

The network access detection module 208 (see FIG. 2) detects the network access of the application. Next, at a state 1404, the application digest generator 201 generates a digest of data relating to the application. The digested data can be in the form of collection data. The collection data can include, for example, source IP address, destination IP address, source port number, destination port number and other network access data.

The process moves to a decision state 1406 where the network access detection module 208 compares the application digest and collection data prepared by the application digest generator 201 to the hash/policy table 204. For example, a hash generated by the application digest generator 201 and collection data can be compared to hashes from the hash/policy table 204 and network access data associated with the hash. In one embodiment, a plurality of different hashes is generated and compared to hashes from the hash/policy table 204. For example, an MD-5 hash and an SHA-1 hash could be generated for the requested application and compared to MD-5 hashes and SHA-1 hashes from the hash/policy table 204. In this way, the behavior of the application is monitored by the network access detection module 208.

If the hash and collection data corresponds to a hash stored in the hash/policy table 204 and the collection data associate with the hash in the hash/policy table 204, the process continues to a state 1410 where the policy associated with the hash is applied in response to the network access request. In this case, the behavior or network attributes of the application matches with an expected behavior for the application. These policies can include allowing the application to access the network, denying access to the network, alerting the user that access to the network may receive further scrutiny by the network administrator, or allow for a certain amount of time for accessing the network. For example, at the end of a specified time the network access detection module 208 does not permit the application to continue accessing the network. Next, at a state 1412, the network access detection module 208 logs the network access data for the classified application to the logging database 206. In this way, a record is maintained of the applications that are allowed to access the network. The process then moves to a state 1414 where the network access detection module 208 monitors the system in order to detect the next network access by the same or another application on the workstation 101.

The retrieved information from the hash/policy table 204 further includes a policy associated with the hash value. In one embodiment, category and/or parent group information that corresponds to the hash value is utilized in selecting the policy. For example, a hash value could be associated with a specific parent group. For example, the parent group could include “productivity,” “communication,” “expected or predetermined network access,” and “unexpected network access.” The parent groups “expected or predetermined network access” and “unexpected network access” may be sub-groups or categories within another group. For example, a hash for a word processing application is associated with the group “productivity,” and sub-categories “word processing” and “expected or predetermined network access.” The sub-category “expected or predetermined network access” could then be associated with a policy that allows the access to the network. The sub-category “unexpected network access” could then be associated with a policy that does not allow access to the network or curtails or limits access to the network.

Returning to the decision state 1406, if the application digest and collection data does not correspond with an application or hash classified in the hash/policy table 204, flow moves to a state 1416 where the network access detection module 208 applies a not-classified application policy to the request to access to the network. The not-classified application policy can include, for example, allowing the application to access the network, denying access, or alerting the user that additional scrutiny will be applied to the network access, while limiting the amount of time that the application is allowed to access the network.

Flow moves to a state 1418 where the network access data for the not-classified application is logged to the logging database 206. The process continues to state 1414 as described above where the network access detection module 208 awaits a request to access the network from the same application or different application.

FIG. 15 is a flow diagram illustrating a process performed by the workstation for uploading and downloading collection data related to the network accessing applications with the application server module 102. The process begins at a start state 1500. Next, at a state 1502, the upload/download module 203 receives an incoming signal from the workstation upload/download module 104. The process proceeds to a decision state 1504 where the upload/download module 203 receives a request to download the hash/policy table 204 from the application server module 102. The time for receiving the download file can be periodic, random, added set time, or in response to polling. The upload/download module 203 and/or the workstation upload/download module 104 can initiate the download to the workstation management module 200.

If it is determined in state 1504 that the upload/download module 203 is receiving a request to download from the application server module 102, the process moves to a state 1506 where the upload/download module 203 receives and stores the hash/policy table 204 or a portion thereof. The hash\policy table 204 can include collection data in the form of network access data.

For example, the application server module 102 can select data from the application inventory database 103 and policies from the policy database 109 for copying to the hash/policy table 204. The application inventory database 103 can include applications that have been categorized by the application database factory 110. The application can be categorized via the classification user interface 106.

A categorized application is an application that is associated with collection data. The collection data can include network access data. In some embodiments, the workstation upload/download module 104 transfers a portion of the hash/policy table 204. For example, the upload/download module 203 can receive an update so that the entire database need not be transmitted. In other embodiments, the upload/download module 203 receives a subset of the data from the application inventory database 103. For example, the selected data could be the hash values which are combined with the policies.

The downloaded data can update the existing hash/policy table 204. The downloaded data can be in the form of collection data from one or more sources. The sources can include the classification user interface 106 and the application database factory 110. As explained above, the collection data can include any additional data associated with the applications, for example, request frequencies associated with the applications from the application inventory database and/or request frequencies associated with the applications from the uncategorized application database 108, and/or indicators. As explained above, the application inventory database 103 can include the uncategorized application database 108 and the network access database 107. Alternatively, the uncategorized application database 108 and/or the network access database 107 are combined into a single database or can be separate databases from the application inventory database 103. The uncategorized application database 108 can include applications which are not classified or categorized along with applications that are not operating in an expected or predetermined manner. The network access data associated with the application may be stored in the network access database 107, the uncategorized application database 108, and/or the application inventory database 103.

The process moves to a state 1510 where the upload/download module 203 awaits a wake-up signal from the application server module 102.

Returning to the decision state 1504, if the upload/download module 203 is not requesting a download from the application server module 102, the process moves to a decision state 1525 to determine whether the application server module 102 is requesting collection data from the logging database 206. If the application server module 102 is not requesting logging data, the process moves to state 1510 as described above. Returning to decision state 1525, if the application server module 102 is requesting logging data, the process moves to decision state 1526 to determine whether the application server module 102 is only requesting collection data from the logging database 206 for uncategorized applications. The uncategorized applications can include applications that were previously categorized but are not operating in a predetermined or expected manner.

If the application server module 102 only requests data for uncategorized applications, the process moves to a state 1528 wherein the upload/download module 203 extracts and formats data associated with the uncategorized applications from the logging database 206 for uploading to the application server module 102. The process next moves to a state 1530 where the data is transmitted to the application server module 102. The collection data uploaded to the application server module 102 can be formatted or unformatted. Additionally, the collection data can be encrypted and/or compressed or not. The workstation upload/download module 104 decrypts and uncompresses the collection data if decryption and/or uncompression is required. The workstation upload/download module 104 reassembles the collection data into a list of applications and any additional data associated with the applications. The workstation upload/download module 104 merges and sorts the collection data.

Next, the process moves to the state 1510 where the workstation management module 200 awaits the next wake-up signal from the application server module 102.

Returning to the decision state 1526, if the application server module 102 is requesting more than the collection data for the uncategorized applications from the logging database 206, the process moves to a state 1532 where the upload/download module 203 extracts and formats all of the application data in the logging database 206. This data can include categorized data for applications that are listed in the hash/policy table 204 and uncategorized data for applications that are not listed in the hash/policy table 204. The collection data can be formatted or unformatted. Additionally, the collection data can be encrypted and/or compressed or not. Flow then proceeds to state 1530 where the data is uploaded to the application server module 102. The flow then proceeds as described above to state 1510 where the workstation management module 200 awaits a wake-up signal from the application server module 102.

FIG. 16 is a flow diagram illustrating a process performed by the application server module for uploading and downloading collection data for network accessing applications with the workstation 101. The process begins at a start state 1600. Next, at a decision state 1602, the workstation upload/download module 104 determines whether to generate a download to the workstation management module 200. The time for receiving the download can be periodic, random, at a set time, or in response to polling. The workstation upload/download module 104 and/or the upload/download module 203 can initiate the download to the workstation management module 200. If the workstation upload/download module 104 is to download to the workstation management module 200, the process moves to a state 1604 where the workstation upload/download module 104 extracts network access data from the network access database 107. The network access data is compiled into a policy database 109. The policy database 109 associates access permissions to each application based on which workstation receives the download. The policy database 109 can include different access privileges for each workstation 101. In this way, different workstations 101 can have different policies associated with the same application running thereon.

The process moves to a state 1606 where the workstation upload/download module 104 creates a network access policy table in conjunction with the designated policies for this workstation. Thus, the same application may be allowed to access a website from a workstation but not allowed to access the same website from a different workstation.

Flow continues to a state 1608 where the workstation upload/download module 104 transmits the network access policy table or a portion thereof to the upload/download module 203. The download file can include the application names, hash values, associated categories, and/or associated policies. Flow then proceeds to end state 1610.

Returning to decision state 1602, if the workstation upload/download module 104 is not generating a download for the workstation 101, the process moves to a decision state 1612 where the workstation upload/download module 104 determines whether to request an upload of the hash and network access data. The hash and network access data can include all, or a portion of, the logging database 206.

If the workstation upload/download module 104 requests an upload from the workstation 101, the process moves to a state 1614 where a request for all or only uncategorized data is sent by the application server module 102 to the upload/download module 203. Next, at a state 1616, the workstation upload/download module 104 receives the requested upload from the workstation 101. The uploaded data can be formatted or unformatted. Additionally, the uploaded data can be encrypted and/or compressed or not. The workstation upload/download module 104 decrypts and uncompresses the uploaded data if decryption and/or uncompression is required at next state 1618.

Flow continues to state 1620 where the workstation upload/download module 104 reassembles the uploaded data into a list of applications and any additional data associated with the network access. The workstation upload/download module 104 merges and sorts the collected data including the frequency count with other workstation inventories. The system can include thousands of workstation management modules, each of which is regularly uploading data from its logging database 206. As explained above, the uploaded data can include any additional data associated with the network access, for example, the source IP address, destination IP address, source port number, destination port number and other network access data. The workstation upload/download module 104 can merge and sort the uploaded data based on the application or any additional data associated with the request for network access. For example, the workstation upload/download module 104 can refer to a destination IP address to sort and merge the applications from one or more workstations 101.

FIG. 17 is a flow diagram illustrating a process for analyzing network access data associated with an application's request to access the network at the application server module 102. The process begins at a start state 1700. Next, at a state 1702, a network administrator launches the classification user interface 106 via the GUI. The GUI provides a graphical interface tool for the network administrator to manipulate and manage the application inventory database 103. The network administrator extracts a list of applications and data from the network access database 107.

The process moves to a state 1704 where the application and any related data is displayed for review by the network administrator. Next, at a state 1706, the network administrator determines the allowed behavior for the application. The process then moves to a state 1708 where the process returns to states 1704 and 1706 for each application extracted from the network access database 107.

FIG. 18 is a flow diagram illustrating a process for uploading network access data from the application server module to the application database factory. The process begins at a start state 1800. Next, at a state 1802, the factory upload/download module 105 requests a download of the categorized applications from the application database factory 110. The categorized applications are stored in the master application database 300 at the application database factory 110. The time for receiving the categorized applications can be periodic, random, at a set time, or in response to polling. The factory upload/download module 105 and/or the upload/download module 301 can initiate the download to the application server module 102. As explained above, the downloaded data can include any additional data associated with the application. The additional data can include network access data.

Flow continues to decision state 1804 where the factory upload/download module 105 (see FIG. 3) determines whether a send all uncategorized applications network access data flag has been activated. The flag can be selected by the network administrator via the classification user interface 106. If the flag has been activated, the process moves to a state 1806 where the factory upload/download module 105 retrieves all uncategorized applications. Flow continues to decision state 1808 where the factory upload/download module 105 determines if the send all network access application inventory flag has been activated. The send all network access application inventory flag can be activated by the network administrator via the classification user interface 106. If the send all network access application inventory flag has been activate, the process moves to a state 1810 where the factory upload/download module 105 retrieves the data from the application inventory database 103. Flow moves to a state 1812 where the uncategorized applications and any additional data associated with the applications, for example, collection data, can be formatted. The additional data can include a source IP address, destination IP address, source port number, destination port number and other network access data associated with the applications. The collection data is not required to be formatted and thus may be directly uploaded to the application database factory 110. Moreover, the selection of a format for the collection data can depend on the type of data connection that the application database factory 110 has with the application server module 102. For a data connection via the Internet 108, the factory upload/download module 105 can use a markup language, for example, extensible markup language (XML), standard generalized markup language (SGML), and hypertext markup language (HTML), to format the collection data.

The collection data can be further processed prior to its upload to the application database factory 110. For example, check limit state 1814 and compression and encryption state 1816 can be performed to process the collection data prior to uploading to the application database factory 110. While these blocks may facilitate the upload of the collection data, they are not required to be performed. The collection data can be uploaded without applying states 1814 and 1816. In this way the process can follow alternate path 1813. Thus, the collection data can be directly uploaded to the application database factory 110 without applying states 1814 and 1816.

If further processing is desired, the process moves to a state 1814 where the factory upload/download module 105 can limit the collection data to a maximum size for uploading to the application database factory 110. For example, the collection data from a single workstation could be limited to a maximum of 20 megabytes. The process continues to a state 1816 where the collection data is compressed so that the collection data takes up less space. Further, the collection data is encrypted so that it is unreadable except by authorized users, for example, the application database factory 110.

Flow continues to a state 1818 where the collection data is uploaded to the application database factory 110. As explained above, the collection data can include any additional data associated with the application, for example, destination port information. The process moves to a state 1820 where the upload/download module 301 continues with the download to the factory upload/download module 105. The process moves to a state 1822 where the downloaded data is stored in the application inventory database 103.

Returning to decision state 1808, if the send all network access application inventory flag is not activated, flow moves to state 1812 as described above. Since the send all network access application inventory flag was not activated, the factory upload/download module 105 formats the data retrieved at state 1806 for its upload to the application database factory 110 as described with reference to states 1812, 1814, 1816 and 1818.

Returning to decision state 1804, if the send all uncategorized applications network access flag was not activated, the process moves to decision state 1808 as described above where the factory upload/download module 105 determines if the send all network access application inventory flag has been activated. Depending on whether the send all network access application inventory flag was activated, the process then continues as described above.

FIG. 19 is a flow diagram illustrating a process for downloading network access data from the application database factory to the application server module. The process begins at a state 1900. Next, at a decision state 1902, the application database factory 110 can download the master application database 300 to the application server module 102. If the application database factory 110 is to download the master application database 300 to the application server module 102, the process moves to a state 1904 where the upload/download module 301 extracts categorized applications from the master application database 300. A subset of the categorized applications can be selected for download to the application server module 102. The subset can include only categorized applications that have been deemed ready for publishing.

The process moves to a state 1906 where the application data retrieved from the master application database 300 can be formatted. The application data is not required to be formatted and this may be directly downloaded to the application server module 102. Moreover, the selection of a format for the data can depend on the type of data connection that the application database factory 110 has with the application server module 102. For a data connection via the Internet 108, the upload/download module 301 can use a markup language, for example, XML, SGML and HTML, to format the collection data.

The data to be downloaded can be further processed prior to its download to the application server module 102. The process continues to a state 1908 where the application data is compressed so that the application data takes up less space. Further, the application data is encrypted so that it is unreadable except by authorized users, for example, the application server module 102. Flow continues to a state 1910 where the application data is downloaded to the application server module 102. The process then moves to state 1912 which is an end state.

Returning to decision state 1902, if application data from the master application database 300 is not being downloaded to the application server module 102, the process moves to a decision state 1914 where the application database factory 110 can receive an upload from the application server module 102. If the application database factory 110 is not to receive an upload from the application server module 102, the process moves to end state 1912.

Returning to decision state 1914, if the application database factory 110 is to receive an upload from the application server module 102, the process moves to a state 1916 where the upload/download module 301 receives the upload from the factory upload/download module 105. The time for receiving the collection data can be periodic, random, at a set time, or in response to polling. The upload/download module 301 and/or the factory upload/download module 105 can initiate the upload to the application database factory 110. As explained above, the collection can include any additional data associated with the application, for example, a source IP address, destination IP address, source port number, destination port number and other network access data associated with the application from the application inventory database 103 and/or a source IP address, destination IP address, source port number, destination port number and other network access data associated with applications from the uncategorized application database 108 and/or the network access database 304. The collection data can be formatted or unformatted. Additionally, the collection data can be encrypted and/or compressed or not.

The process continues to a state 1918 where the upload/download module 301 decrypts and uncompresses the collection data if decryption and/or uncompression is required. The process moves to a state 1920 where the collection data is merged and sorted into the master application database 300 and the uncategorized application database 303 and/or the network access database 304. The process then continues to end state 1912.

FIG. 20 is a flow diagram illustrating a process for analyzing the network access data associated with an application at the application database factory. The process begins at start state 2000. The process moves to a state 2002 where a list of applications is extracted from the uncategorized application database 303 and/or the network access database 304 for classification by the human reviewer via the application analyst's classification module 302. The application analyst classification module 302 interfaces with the human reviewer to determine the appropriate category or categories of the application. These categories may include “expected or predetermined network access” and “unexpected network access.” The category “expected or predetermined network access” could then be associated with a policy that allows the access to the network. The category “unexpected network access” could then be associated with a policy that does not allow access to the network.

Next, at a state 2004, the application analyst's classification module 302 is utilized to display the application and any related data on the GUI. The related data can indicate to the human reviewer the expected network activity for the application. As explained above, the application analyst classification module 302 allows the human reviewer to analyze each application and any additional data that is associated with the application to determine an expected or allowed network activity.

The process continues to a state 2006 where the human reviewer analyzes the application, related information, and any Internet related information. The Internet information can be derived from a search using a web browser search engine. The application name and any of the related collection data can be used for the Internet search.

The expected or allowed network activity can be based upon prior or contemporaneous network activity for the same application. For example, the expected network activity for an application running on a first workstation 101 can be determined from a record of that application's prior activity on the first workstation. In addition or in the alternative, the expected network activity for an application is determined from a record of that application's prior activity on multiple workstations.

The expected network activity can be determined from network activity by a different but related application. For example, the programs or applications from a single software company may have common access privileges. The access privilege associated with a later version of an application may share common access privileges with an earlier version of the same application.

The network activity of the same application running on different workstations can be weighted in a predetermined manner to determine an expected network activity for the application. The expected network activity can determine a common access privilege for multiple workstations. The workstation management module stores the expected network activity in the hash/policy table 204. In a preferred embodiment, the network activity from multiple workstations is uploaded to the application database factory 110. The access privilege can be determined at the application database factory 110.

The expected network activity for a given application can include one or more network attributes that are associated with the application. The attributes are associated with the application when the application accesses the network in an expected manner. These attributes can include, for example, a specific protocol, a specific I.P. address, and a specific access port. For example, the specific protocol for an application is listed in the hash/policy table 204. If the application requests access to the network using a different protocol than the expected protocol listed in the hash/policy table 204, the network access detection module 208 may disallow access.

An application may request access to the network multiple times in a single day. However, one or more of the network attributes associated with the application may be different for each attempted access. In this way, the attributes of the application may change over time. The network detection access module 208 may allow a first combination of one or more network attributes while disallowing a second combination of the one or more network attributes. The human reviewer can further review documents, specifications, manuals, and the like to best determine the expected or allowed behavior for the network requesting application.

Each combination of the one or more attributes can be associated with one or more categories. The one or more categories can be further associated with the policies or rules for the workstation 101 and/or user.

The process continues to a state 2008 where the human reviewer determines the allowed behavior for the application using the evidence associated with the application, any hints from the related information, and/or other research. The process finally moves to a state 2010 where the allowed behavior for the network requesting application is stored in the master application database 300.

While the above detailed description has shown, described, and pointed out novel features of the invention as applied to various embodiments, it will be understood that various omissions, substitutions, and changes in the form and details of the device or process illustrated may be made by those skilled in the art without departing from the spirit of the invention. The scope of the invention is indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope. 

What is claimed is:
 1. A system, including one or more processors, for collecting network access data for use in updating a monitoring system which controls programs accessing a network, comprising: a workstation management module configured to detect a program on a workstation accessing a network, determine whether the program is in a network access database, send program data associated with the program to an application server module if the program is not in the network access database, and apply one or more policies that are associated with the program, wherein the network access database includes a protocol that is associated with the program; the application server module being configured to receive the program data from the workstation management module if the program was not in the network access database, determine whether the program is operating in a predetermined manner, wherein said predetermined manner means the program is operating in a manner determined by past network activity involving the same or relevant programs, if the program is not operating in a predetermined manner, then send the program data to an application database factory, if the program is operating in a predetermined manner, then provide the one or more policies associated with the program to the workstation management module, wherein the application server module is further configured to analyze the program data for a data characteristic that is indicative of whether the program is operating in the predetermined manner, and to associate one or more indicators with the program; and wherein analyzing the program data is performed on text strings that are associated with the program; a classification user interface configured to provide an interface for a network administrator to select the one or more policies that are associated with the program; and an upload/download manager module configured to send the program data to the application database factory and to receive the one or more policies from the application database factory.
 2. The system of claim 1 wherein the application database factory is configured to receive the program data from the application server module if the program is not operating in a predetermined manner, determine whether the program was previously analyzed by the application database factory, if the program was not previously analyzed, then determine one or more policies to associate with the program and provide the one or more policies to the application server module, if the program was previously analyzed, then provide the one or more policies that were previously associated with the program data to the application server module.
 3. The system of claim 1, wherein the protocol is a transport protocol.
 4. The system of claim 3, wherein the transport protocol is transmission control protocol (TCP).
 5. The system of claim 3, wherein the transport protocol is user database protocol (UDP).
 6. The system of claim 1, wherein the network access database comprises hash values.
 7. The system of claim 1, wherein the network access database comprises one or more categories and one or more policies associated with the program.
 8. The system of claim 1, wherein the workstation management module comprises an application digest generator configured to determine the program data to associate with the program.
 9. The system of claim 1, wherein the program data includes a source IP address.
 10. The system of claim 1, wherein the program data includes a destination IP address.
 11. The system of claim 1, wherein the one or more policies include allowing the program to access the network based on the one or more policies associated with the program and the user.
 12. The system of claim 1, wherein the one or more policies include not allowing the program to access the network based on the one or more policies associated with the program and the user.
 13. A system, including one or more processors, for collecting network access data for use in updating a monitoring system which controls a program on a computer from accessing a network based at least in part on information collected from another computer over the network, the system comprising: a first workstation management module configured to detect a program on a first workstation accessing a network, determine whether the program is in a first network access database, send program data associated with the program to an application server module if the program is not in the first network access database, and apply one or more policies that are associated with the program; the application server module being configured to receive the program data from the first workstation management module if the program was not in the first network access database, determine whether the program is operating in a predetermined manner, wherein said predetermined manner means the program is operating in a manner determined by past network activity involving the same or relevant programs, if the program is not operating in a predetermined manner, then send the program data to an application database factory, if the program is operating in a predetermined manner, then provide the one or more policies associated with the program to at least a second workstation; wherein the application server module is further configured to analyze the program data for a data characteristic that is indicative of whether the program is operating in the predetermined manner, and to associate one or more indicators with the program; and wherein analyzing the program data is performed on text strings that are associated with the program; and a second workstation management module being configured to receive the one or more policies from the application server module and update a second network access database resident on the second workstation.
 14. The system of claim 13, wherein the one or more indicators includes a category flag.
 15. The system of claim 13, wherein the application server module uses the one or more indicators to screen the program prior to sending the program data to the application database factory.
 16. A system, including one or more processors, for collecting network access data for use in updating a monitoring system which controls programs accessing a network, comprising: a workstation management module configured to detect a program on a workstation accessing a network, determine whether the program is in a network access database, send program data associated with the program to an application server module if the program is not in the network access database, and apply one or more policies that are associated with the program, wherein the network access database includes a protocol that is associated with the program; the application server module being configured to receive the program data from the workstation management module if the program was not in the network access database, analyze the program data for a data characteristic that is indicative of whether the program is operating in a predetermined manner and to associate one or more indicators with the program, wherein said predetermined manner means the program is operating in a manner determined by past network activity involving the same or relevant programs, if the program is not operating in a predetermined manner, then send the program data and the data characteristic to an application database factory, if the program is operating in a predetermined manner, then provide the one or more policies associated with the program to the workstation management module; and wherein the application server module is further configured to analyze the program data for a data characteristic that is indicative of whether the program is operating in the predetermined manner, and to associate one or more indicators with the program; and wherein analyzing the program data is performed on text strings that are associated with the program.
 17. The system of claim 16, wherein the one or more indicators includes a category flag. 